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Paper 
Number Paper Title 
1315 "IT IS OKAY TO BE UNCOMMON": QUANTIZING 


SOUND EVENT DETECTION NETWORKS ON 
HARDWARE ACCELERATORS WITH UNCOMMON SUB- 
BYTE SUPPORT 


3377 1-D SPATIAL ATTENTION IN BINARIZED 
CONVOLUTIONAL NEURAL NETWORKS 


4343 2D Human Pose Estimation Calibration and Keypoint 
Visibility Classification 


9562 3D AUTOMATED QUANTITATIVE CALCULATIONS 
BASED ON CT IMAGES OF THE HIP JOINT 


9987 3D Hand Joint and Grasping Estimation for 
Teleoperation System 


5765 3D PARALLELISM FOR TRANSFORMERS VIA INTEGER 
PROGRAMMING 


10067 3D POINT CLOUD SEMANTIC SEGMENTATION BASED 
ON DIFFUSION MODEL 


6588 3D POSE ESTIMATION FROM MONOCULAR VIDEO 
WITH CAMERA-BONE ANGLE REGULARIZATION ON 
THE IMAGE FEATURE 


2858 3DSAM: SEGMENT ANYTHING IN NERF 


4353 3M-TRANSFORMER: A MULTI-STAGE MULTI-STREAM 
MULTIMODAL TRANSFORMER FOR EMBODIED TURN- 
TAKING PREDICTION 


2738 3S-TSE: EFFICIENT THREE-STAGE TARGET SPEAKER 
EXTRACTION FOR REAL-TIME AND LOW-RESOURCE 
APPLICATIONS 
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Number Paper Title 


9281 


10265 


7550 


4797 


3674 


8415 


2465 


4570 


4413 


7266 


8328 


6150 


7551 


10309 


1527 


6DOF SELD: SOUND EVENT LOCALIZATION AND 
DETECTION USING MICROPHONES AND MOTION 
TRACKING SENSORS ON SELF-MOTIONING HUMAN 


A 3D VIRTUAL TRY-ON METHOD WITH GLOBAL-LOCAL 
ALIGNMENT AND DIFFUSION MODEL 


A BAYESIAN APPROACH TO HIGH-ORDER LINK 
PREDICTION 


A BINARY BP DECODING USING POSTERIOR 
ADJUSTMENT FOR QUANTUM LDPC CODES 


A BI-PYRAMID MULTIMODAL FUSION METHOD FOR 
THE DIAGNOSIS OF BIPOLAR DISORDERS 


A BiRGAT Model for Multi-intent Spoken Language 
Understanding with Hierarchical Semantic Frames 


A CCM-BASED JOINT DOA-FREQUENCY ESTIMATION 
AND SIGNAL RECOVERY WITH EFFICIENT SUB- 
NYQUIST SAMPLING 


A Chat About Boring Problems: Studying GPT-based 
text normalization 


A Closer Look at Wav2Vec2 Embeddings for On-device 
Single-channel Speech Enhancement 


A CODEC-BASED APPROACH FOR VIDEO LIFE-CYCLE 
CHARACTERIZATION IN SOCIAL NETWORKS 


A COMPARATIVE ANALYSIS OF POETRY READING 
AUDIO: SINGING, NARRATING, OR SOMEWHERE IN 
BETWEEN? 


A COMPARATIVE STUDY ON ANNOTATION QUALITY 
OF CROWDSOURCING AND LLM VIA LABEL 
AGGREGATION 


A COMPARISON OF PARAMETER-EFFICIENT ASR 
DOMAIN ADAPTATION METHODS FOR UNIVERSAL 
SPEECH AND LANGUAGE MODELS 


A complete method for the 3D reconstruction of 
axonal pathways from 2 orthogonal 3D OCT images of 
the lamina cribrosa 


A COMPREHENSIVE ANALYSIS OF BIASES AND CUES 
IN NLU DATASETS AND MODELS WITH ICQ 
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4480 


4430 


8850 


2992 


7405 


4762 


4905 


1858 
4646 


6415 


1378 


7461 


2700 


8136 


3263 


A COMPREHENSIVE FRAMEWORK FOR OCCLUDED 
HUMAN POSE ESTIMATION 


A COMPUTATIONALLY EFFICIENT SEMI-BLIND SOURCE 
SEPARATION APPROACH FOR NONLINEAR ECHO 
CANCELLATION BASED ON AN ELEMENT-WISE 
ITERATIVE SOURCE STEERING 


A CONCEPT FOR A SLAM BACK END HARDWARE 
ACCELERATOR 


A CONTRARIO PARADIGM FOR YOLO-BASED 
INFRARED SMALL TARGET DETECTION 


A Contrast Embedding Based Domain Adaptation 
Network for Singing Melody Extraction 


A CONVERGENT PRIMAL-DUAL DEEP PLUG-AND-PLAY 
ALGORITHM FOR CONSTRAINED IMAGE 
RESTORATION 


A CROSS SEARCH METHOD FOR DATA 
AUGMENTATION IN NEURAL MACHINE TRANSLATION 


A crowdsourcing approach to video quality assessment 


A Deep Representation Learning-based Speech 
Enhancement Method Using Complex Convolution 
Recurrent Variational Autoencoder 


A DENSENET-BASED METHOD FOR DECODING 
AUDITORY SPATIAL ATTENTION WITH EEG 


A DENSITY-GUIDED TEMPORAL ATTENTION 
TRANSFORMER FOR INDISCERNIBLE OBJECT 
COUNTING IN UNDERWATER VIDEOS 


A DETAILED AUDIO-TEXT DATA SIMULATION PIPELINE 
USING SINGLE-EVENT SOUNDS 


A DISTRIBUTED JOINT INTEGRATED PROBABILISTIC 
DATA ASSOCIATION (JIPDA) FILTER WITH SOFT OBJECT 
ASSOCIATION 


A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND- 
TIME EXCITED NETWORK FOR ANOMALOUS SOUND 
DETECTION 


A FACIAL EXPRESSION TRANSFER METHOD BASED ON 
3DMM AND DIFFUSION MODELS 
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4860 


7284 


7167 


3175 


5517 


7959 


5969 


6563 


5990 


7292 


7381 


9005 


1578 


8207 


3728 


1469 


A FAST BLIND DEBLURRING ALGORITHM USING 
LOCAL GRADIENT PRODUCT PRIOR 


A FAST, PERFORMANT, SECURE DISTRIBUTED 
TRAINING FRAMEWORK FOR LLM 


A FEDERATED GRAPH TO EMBEDDING APPROACH FOR 
KNOWLEDGE GRAPH COMPLETION 


A FINE-GRAINED TRI-MODAL INTERACTION MODEL 
FOR MULTIMODAL SENTIMENT ANALYSIS 


A FLEXIBLE ONLINE FRAMEWORK FOR PROJECTION- 
BASED STFT PHASE RETRIEVAL 


A FOUNDATION MODEL FOR MUSIC INFORMATICS 


A FRAMEWORK FOR PORTRAIT STYLIZATION WITH 
SKIN-TONE AWARENESS AND NUDITY 
IDENTIFICATION 


A fully differentiable model for unsupervised singing 
voice separation 


A GENERAL FRAMEWORK FOR ROTATION INVARIANT 
POINT CLOUD ANALYSIS 


A Generative Adversarial Framework for Dialogue 
Generation with Neural Architecture Search 


A GIBBS SAMPLER FOR BAYESIAN NONPARAMETRIC 
STATE-SPACE MODELS 


A GRAPH NEURAL NETWORK BASED APPROACH FOR 
FAULT DELINEATION IN SEISMIC DATA USING GRAPH 
TOTAL VARIATION AND MULTIGRAPH 


A GRAPH NEURAL NETWORK BASED FUSION OF MRI- 
DERIVED BRAIN NETWORK AND CLINICAL DATA FOR 
GLIOBLASTOMA SURVIVAL PREDICTION 


A GRAPH-PREDICTION-BASED APPROACH FOR 
DEBIASING UNDERREPORTED DATA 


A GUIDED UPSAMPLING NETWORK FOR SHORT WAVE 
INFRARED IMAGES USING GRAPH REGULARIZATION 


A Hierarchical multi-proxy Loss with Dynamic Main- 
proxy for Deep Metric Learning 
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3912 


8486 


2302 


4796 


3715 


6868 


3180 


3481 


10170 


2931 


8389 
8111 


6005 


3105 


3053 


A HYBRID CNN-TRANSFORMER FOR FOCAL LIVER 
LESION CLASSIFICATION 


A HYBRID DEEP-ONLINE LEARNING BASED METHOD 
FOR ACTIVE NOISE CONTROL IN WAVE DOMAIN 


A JOINT DATA COMPRESSION AND TIME-DELAY 
ESTIMATION METHOD FOR DISTRIBUTED SYSTEMS 
VIA EXTREMUM ENCODING 


A JOINT LOOK ON LUNAR SATELLITE AND 
COOPERATIVE SURFACE PNT 


A Learning Resource Recommendation Algorithm 
Based on Online Learning Behavior 


A LEARNING-BASED MULTI-NODE FUSION 
POSITIONING METHOD USING WEARABLE INERTIAL 
SENSORS 


A LEARNING-BASED SYSTEM FOR AUTOMATIC 
DECEPTION DETECTION FROM DOSING VIDEOS 


A Lightweight Change Detection Method Based on 
Feature Interaction and Transformer for High 
Resolution Remote Sensing Images 


A LIGHTWEIGHT HYBRID MULTI-CHANNEL SPEECH 
EXTRACTION SYSTEM WITH DIRECTIONAL VOICE 
ACTIVITY DETECTION 


A LIGHT-WEIGHT STATE DETECTION MODEL FOR 
KALMAN-FILTER-BASED ACOUSTIC FEEDBACK 
CANCELLATION WITH RAPID RECOVERY FROM 
ABRUPT PATH CHANGES 


A LOW-LATENCY FFT-IFFT CASCADE ARCHITECTURE 


A Machine-Learning Model for Detecting Depression, 
Anxiety, and Stress from Speech 


A META-PRECONDITIONING APPROACH FOR DEEP Q- 
LEARNING 


A METHOD FOR BILEVEL OPTIMIZATION WITH 
CONVEX LOWER-LEVEL PROBLEM 


A METHOD FOR X-RAY IMAGE LANDMARKS 
LOCALIZATION USING CYCLIC COORDINATE-GUIDED 
STRATEGY 
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Number Paper Title 


7492 


1493 


9344 


5867 


4185 


7410 


7347 


1565 


4557 


9353 


5016 


5623 


7147 


7183 


5558 


A MODIFIED CRAMÉR-RAO BOUND FOR DISCRETE- 
TIME MARKOVIAN DYNAMIC SYSTEMS 


A MULTI-CARRIER INFORMATION HIDING ALGORITHM 
BASED ON LAYERED COMPRESSION OF 3D POINT 
CLOUD MODEL 


A MULTIMODAL ADAPTIVE COOPERATIVE LEARNING 
METHOD FOR CANCER SURVIVAL RISK PREDICTION 


A MULTI-SCALE BIMODAL FUSION NETWORK FOR 
ROBUST AND ACCURATE ONLINE HANDWRITING 
RECOGNITION 


A MULTISCALE OBJECTIVE FUNCTION FOR CAMERA 
COLOR CORRECTION 


A Neural Syntax Parser for Coronary Artery Anatomical 
Labeling in Coronary CT Angiography 


A Neurophysiological-Auditory "Listen Receipt" for 
Communication Enhancement 


A New Fourth-Order Sparse Array Generator Based on 
Sum-Difference Co-array Analysis 


A New Perspective on Understanding Resolution Limit 
via An Asymptotic Study of Christoffel-Darboux Kernel 
based Spectrum Estimator 


A New Pre-training Paradigm for Offline Multi-agent 
Reinforcement Learning with Suboptimal Data 


A new similarity-based relational knowledge distillation 
method 


A NOVEL 3-D FOCUSING SCHEME FOR DISTRIBUTED 
SAR TOMOGRAPHY 


A NOVEL ARCHITECTURE OF DEEP FEATURE-BASED 
GAUSSIAN PROCESSES WITH AN ENSEMBLE OF 
KERNELS 


A NOVEL CASCADE INSTRUCTION TUNING METHOD 
FOR BIOMEDICAL NER 


A Novel Contrastive Diffusion Graph Convolutional 
Network for Few-Shot Skeleton-Based Action 
Recognition 


Paper 


Number Paper Title 


10198 


1907 


8073 


7081 


5357 


6674 


9506 


8931 


3150 


5134 


8638 


6132 


3216 


A NOVEL CROSS-SENSOR SELF-SUPERVISED 
LEARNING METHOD FOR ROTATING MACHINERY 
FAULT DIAGNOSIS 


A NOVEL DEMODULATION AND SELECTION PILOT 
POWER TRADE-OFF FOR CODEBOOK-BASED IRS WITH 
IMPERFECT CHANNEL ESTIMATES 


A NOVEL DISCRETE FRACTIONAL COMPLEX 
HADAMARD TRANSFORM FOR MEDICAL IMAGE 
ENCRYPTION 


A NOVEL ITERATIVE THRESHOLDING ALGORITHM FOR 
ARCTANGENT REGULARIZATION PROBLEM 


A NOVEL LOCAL-GLOBAL FEATURE FUSION 
FRAMEWORK FOR BODY-WEIGHT EXERCISE 
RECOGNITION WITH PRESSURE MAPPING SENSORS 


A NOVEL MEDICAL IMAGE FUSION FRAMEWORK 
INTEGRATING MULTI-SCALE ENCODER-DECODER 
WITH DISCRETE WAVELET DECOMPOSITION 


A Novel Multi-atlas Fusion Model Based On 
Contrastive Learning For Functional Connectivity Graph 
Diagnosis 


A NOVEL MULTIMODAL SENTIMENT ANALYSIS MODEL 
BASED ON GATED FUSION AND MULTI-TASK 
LEARNING 


A NOVEL RESIDUAL-GUIDED LEARNING METHOD FOR 
IMAGE STEGANOGRAPHY 


A One-Class Approach to Detect Super-Resolution 
Satellite Imagery with Spectral Features 


A parameterized generative adversarial network using 
cyclic projection for explainable medical image 
classifications 


A PLS-INTEGRATED LASSO METHOD WITH 
APPLICATION IN INDEX TRACKING 


A PRACTICAL ONLINE MULTICHANNEL 
DEREVERBERATION APPROACH WITH DATA-REUSE 
TECHNIQUE 


Paper 


Number Paper Title 


7041 


2226 


4564 


10384 


5455 


8948 


9012 


7076 


3574 


2450 


1463 


7610 


8720 


4345 


4076 


A PRIOR DRIVEN SEMI-SUPERVISED VITGAN FOR 
IMAGE RECOLORIZATION 


A PROBABILITY GRADIENT BASED APPROACH FOR 
SAMPLING BOUNDARIES OF IN-DOMAIN DATA 


A Prompt-based Method With Multi-View 
Optimization for Open Relation Extraction 


A Property-Guided Diffusion Model for Generating 
Molecular Graphs 


A RAY-TRACING BASED FINGERPRINTING METHOD 
FOR PASSIVE LOCALIZATION IN URBAN NLOS 
ENVIRONMENT 


A REAL-TIME ACTIVE SPEAKER DETECTION SYSTEM 
INTEGRATING AN AUDIO-VISUAL SIGNAL WITH A 
SPATIAL QUERYING MECHANISM 


A REAL-TIME LYRICS ALIGNMENT SYSTEM USING 
CHROMA AND PHONETIC FEATURES FOR CLASSICAL 
VOCAL PERFORMANCE 


A REAL-TIME VIDEO QUALITY METRIC FOR HTTP 
ADAPTIVE STREAMING 


A RECONSTRUCTION-BASED FEATURE ADAPTATION 
FOR ANOMALY DETECTION WITH SELF-SUPERVISED 
MULTI-SCALE AGGREGATION 


A Reduced-Reference Quality Assessment Metric for 
Textured Mesh Digital Humans 


A Relation-Aware Heterogeneous Graph Transformer 
on Dynamic Fusion for Multimodal Classification Tasks 


A REVIEW OF SELF-SUPERVISED METHODS FOR MUSIC 
TAGGING 


A RIEMANNIAN-BASED JOINT DESIGN FRAMEWORK 
OF MIMO RADAR TRANSMIT WAVEFORM AND 
RECEIVE FILTER VIA INFORMATION THEORY 


A ROBUST AND SCALABLE METHOD WITH AN 
ANALYTIC SOLUTION FOR MULTI-SUBJECT FMRI DATA 
ANALYSIS 


A Robust GLRT Detector against Missing Data in 
Cooperative Sensing 


Paper 


Number Paper Title 


8800 


4433 


2018 


7436 


6579 


3439 


9120 


2383 


2801 


1461 


8613 


3887 


8645 


A ROBUST PITCH-FUSION MODEL FOR SPEECH 
EMOTION RECOGNITION IN TONAL LANGUAGES 


A ROBUST QUANTILE HUBER LOSS WITH 
INTERPRETABLE PARAMETER ADJUSTMENT IN 
DISTRIBUTIONAL REINFORCEMENT LEARNING 


A SALIENCY ENHANCED FEATURE FUSION BASED 
MULTISCALE RGB-D SALIENT OBJECT DETECTION 
NETWORK 


A SCALABLE SPARSE TRANSFORMER MODEL FOR 
SINGING MELODY EXTRACTION 


A SCANNING LASER OPHTHALMOSCOPE IMAGE 
DATASET: TOWARDS MULTIPLE FUNDUS DISEASE 
DETECTION WITH DEEP LEARNING 


A SELF-SUPERVISED PRESSURE MAP HUMAN 
KEYPOINT DETECTION APPROCH: OPTIMIZING 
GENERALIZATION AND COMPUTATIONAL EFFICIENCY 
ACROSS DATASETS 


A SEPARATION PRIORITY PIPELINE FOR SINGLE- 
CHANNEL SPEECH SEPARATION IN NOISY 
ENVIRONMENTS 


A SEQUENTIAL AVERAGING PLUG-AND-PLAY METHOD 
FOR IMAGE RESTORATION VIA FIXED-POINT 
PROJECTION 


A Smoothed Bregman Proximal Gradient Algorithm for 
Decentralized Nonconvex Optimization 


A SOFT CONTRASTIVE LEARNING-BASED PROMPT 
MODEL FOR FEW-SHOT SENTIMENT ANALYSIS 


A SOUND APPROACH: USING LARGE LANGUAGE 
MODELS TO GENERATE AUDIO DESCRIPTIONS FOR 
EGOCENTRIC TEXT-AUDIO RETRIEVAL 


A Sparse Array Complete Model Error Self-corrected 
DOA Estimation Based on Atomic Norm 


A SPATIAL LONG-TERM ITERATIVE MASK ESTIMATION 
APPROACH FOR MULTI-CHANNEL SPEAKER 
DIARIZATION AND SPEECH RECOGNITION 


Paper 


Number Paper Title 


2161 


10394 


2824 


7297 


1690 


2770 


10275 


2806 


3187 


4187 


2325 


9834 


6974 


4295 


A SPEAKER RECOGNITION METHOD BASED ON 
STABLE LEARNING 


A SPECTRAL ANALYSIS OF GRAPH NEURAL 
NETWORKS ON DENSE AND SPARSE GRAPHS 


A STATISTICAL CHARACTERIZATION OF 
COMMUNICATION PERFORMANCE IN RIS-AIDED 
NETWORKS 


A STEERED RESPONSE POWER APPROACH WITH 
BILINEAR PREDICTION-BASED TRADE-OFF 
PREWHITENING FOR SPEAKER LOCALIZATION 


A STOCHASTIC GRADIENT APPROACH FOR 
COMMUNICATION EFFICIENT CONFEDERATED 
LEARNING 


A Stochastic Proximal WMMSE for Ergodic Sum Rate 
Maximization 


A STUDY OF MISPRONUNCIATION DETECTION AND 
DIAGNOSIS BASED ON META-LEARNING 


A STUDY OF MULTICHANNEL SPATIOTEMPORAL 
FEATURES AND KNOWLEDGE DISTILLATION ON 
ROBUST TARGET SPEAKER EXTRACTION 


A STUDY ON COMBINING NON-PARALLEL AND 
PARALLEL METHODOLOGIES FOR MANDARIN- 
ENGLISH CROSS-LINGUAL VOICE CONVERSION 


A STUDY ON GRAPH EMBEDDING FOR SPEAKER 
RECOGNITION 


A STUDY ON THE ADVERSE IMPACT OF SYNTHETIC 
SPEECH ON SPEECH RECOGNITION 


A Supervised Information Enhanced Multi-granularity 
Contrastive Learning Framework for EEG based 
Emotion Recognition 


A TARGETED ADVERSARIAL ATTACK METHOD FOR 
MULTI-CLASSIFICATION MALICIOUS TRAFFIC 
DETECTION 


A Transformer Approach for Polyphonic Audio-to- 
Score Transcription 


Paper 


Number Paper Title 


1566 


5109 


9417 


10178 


1226 


2091 


7036 


1047 


7832 


8952 


8961 


3666 


9936 


2107 


9573 


A TRI-DYNAMIC PREPROCESSING FRAMEWORK FOR 
UGC VIDEO COMPRESSION 


A TWO-STAGE DEHAZING FRAMEWORK BASED ON 
INVERTED IMAGE CURVE-ENHANCEMENT 


A TWO-STAGE FRAMEWORK IN CROSS-SPECTRUM 
DOMAIN FOR REAL-TIME SPEECH ENHANCEMENT 


A UNIFIED DNN-BASED SYSTEM FOR INDUSTRIAL 
PIPELINE SEGMENTATION 


A UNIFIED FRAMEWORK FOR MULTI-INTENT SPOKEN 
LANGUAGE UNDERSTANDING WITH PROMPTING 


A UNIFIED FRONT-END FRAMEWORK FOR ENGLISH 
TEXT-TO-SPEECH SYNTHESIS 


A UNIFIED LOSS FUNCTION TO TACKLE INTER-CLASS 
AND INTRA-CLASS DATA IMBALANCE IN SOUND 
EVENT DETECTION 


A VARIABLE SMOOTHING FOR NONCONVEXLY 
CONSTRAINED NONSMOOTH OPTIMIZATION WITH 
APPLICATION TO SPARSE SPECTRAL CLUSTERING 


A WASSERSTEIN GRAPH DISTANCE BASED ON 
DISTRIBUTIONS OF PROBABILISTIC NODE 
EMBEDDINGS 


A weighted-variance variational autoencoder model for 
speech enhancement 


AAT: ADAPTING AUDIO TRANSFORMER FOR VARIOUS 
ACOUSTICS RECOGNITION TASKS 


ACCELERATED RECOVERY OF SPECTRALLY SPARSE 
SIGNALS VIA MODIFIED PROXIMAL GRADIENT IN 
HANKEL SPACE 


ACCELERATING GRADIENT DESCENT FOR OVER- 
PARAMETERIZED ASYMMETRIC LOW-RANK MATRIX 
SENSING VIA PRECONDITIONING 


ACCENT-SPECIFIC VECTOR QUANTIZATION FOR JOINT 
UNSUPERVISED AND SUPERVISED TRAINING IN 
ACCENT ROBUST SPEECH RECOGNITION 


Accurate and Robust Scene Text Recognition via 
Adversarial Training 


Paper 


Number Paper Title 


6057 


8578 


7271 


9014 


2333 


5967 


2933 


7978 


4052 


9895 


6794 


9832 


1820 


5503 


ACCURATE GIGAPIXEL CROWD COUNTING BY 
ITERATIVE ZOOMING AND REFINEMENT 


ACCURATE INTERPOLATION OF SCATTERED DATA VIA 
LEARNING RELATION GRAPH 


ACOUSTIC BPE FOR SPEECH GENERATION WITH 
DISCRETE TOKENS 


ACTIVATION COMPRESSION OF GRAPH NEURAL 
NETWORKS USING BLOCK-WISE QUANTIZATION WITH 
IMPROVED VARIANCE MINIMIZATION 


ACTIVE EXPLAINABLE RECOMMENDATION WITH 
LIMITED LABELING BUDGET 


ACTIVE LEARNING FOR SOUND EVENT 
CLASSIFICATION USING BAYESIAN NEURAL 
NETWORKS WITH GAUSSIAN VARIATIONAL 
POSTERIOR 


ACTIVE LEARNING WITH CORE-SET SAMPLING AND 
SCALE-SENSITIVE LOSS FOR 3D OBJECT DETECTION 


ACTIVE NOISE CONTROL OVER 3D SPACE WITH A 
DYNAMIC NOISE SOURCE 


ACTIVE NOISE CONTROL OVER A LARGE REGION 
WITH MULTIPLE SPHERICAL MICROPHONE ARRAYS IN 
WAVE DOMAIN 


Activity recognition method based on Kernel 
Supervised Laplacian Eigenmaps 


ADAFL: ADAPTIVE CLIENT SELECTION AND DYNAMIC 
CONTRIBUTION EVALUATION FOR EFFICIENT 
FEDERATED LEARNING 


ADAMER-CTC: CONNECTIONIST TEMPORAL 
CLASSIFICATION WITH ADAPTIVE MAXIMUM 
ENTROPY REGULARIZATION FOR AUTOMATIC SPEECH 
RECOGNITION 


AdaPlus: Integrating Nesterov Momentum and Precise 
Stepsize Adjustment on AdamW Basis 


ADAPTER-BASED INCREMENTAL LEARNING FOR FACE 
FORGERY DETECTION 


Paper 


Number Paper Title 


8853 


4814 


6768 


4508 


1939 


5188 


3306 


10109 


1537 


4698 


4588 


7628 


3959 


2075 


8653 


7636 


ADAPTING FRECHET AUDIO DISTANCE FOR 
GENERATIVE MUSIC EVALUATION 


ADAPTING LARGE LANGUAGE MODEL WITH SPEECH 
FOR FULLY FORMATTED END-TO-END SPEECH 
RECOGNITION 


ADAPTING PITCH-BASED SELF SUPERVISED LEARNING 
MODELS FOR TEMPO ESTIMATION 


Adaptive Chroma Block Vector Derivation From Luma 
for Screen Content Coding 


Adaptive Confidence Multi-View Hashing for 
Multimedia Retrieval 


ADAPTIVE DATA AUGMENTATION FOR ASPECT 
SENTIMENT QUAD PREDICTION 


ADAPTIVE FOURIER DECOMPOSITION BASED SIGNAL 
EXTRACTION ON WEAK ELECTROMAGNETIC FIELD 


Adaptive Gaussian Regularization Constrained Sparse 
Subspace Clustering for Image Segmentation 


ADAPTIVE GRID 2-D DIRECTION OF ARRIVAL 
ESTIMATION METHOD USING AN INTEGRATED 
DICTIONARY 


ADAPTIVE HEAD POSE ESTIMATION WITH REAL-TIME 
STRUCTURED LIGHT 


ADAPTIVE IMAGE-ENHANCED KNOWLEDGE GRAPH 
COMPLETION 


ADAPTIVE JOINT CHANNEL ESTIMATION/DATA 
DETECTION IN FLEXIBLE MULTICARRIER MIMO 
SYSTEMS - A TENSOR-BASED APPROACH 


ADAPTIVE KALMANNET: DATA-DRIVEN KALMAN 
FILTER WITH FAST ADAPTATION 


ADAPTIVE MULTI-ARMED BANDIT LEARNING FOR 
TASK OFFLOADING IN MOBILE EDGE COMPUTING 


Adaptive Multi-Exposure Fusion for Enhanced Neural 
Radiance Fields 


ADAPTIVE MULTIVIEW COMMUNITY-PRESERVED 
GRAPH CONVOLUTIONAL NETWORK FOR 


Paper 


Number Paper Title 


3672 


10002 


4135 


8588 


7010 


7844 


8655 


5443 


8681 


5017 


8672 


3635 


7216 


8313 


7480 


5478 


MULTIATLAS-BASED FUNCTIONAL CONNECTIVITY 
ANALYSIS 


Adaptive Multi-View Joint Contrastive Learning on 
Graphs 


ADAPTIVE ORDER AGGREGATOR AND EXTRACTOR 
GRAPH NEURAL NETWORK 


Adaptive parameter sharing for multi-agent 
reinforcement learning 


ADAPTIVE PEDESTRIAN TRAJECTORY PREDICTION VIA 
TARGET-DIRECTED ANGLE AUGMENTATION 


ADAPTIVE QUANTIZATION WITH MIXED-PRECISION 
BASED ON LOW-COST PROXY 


Adaptive Reweighted Sparse Belief Propagation 
Decoding for Polar Codes 


ADAPTIVE SECONDARY TRANSFORM SETS FOR VIDEO 
CODING BEYOND AV1 


Adaptive Sensor Selection With Deterministic Priors for 
DoA Tracking 


ADAPTIVE SPATIAL-TEMPORAL HYPERGRAPH FUSION 
LEARNING FOR NEXT POI RECOMMENDATION 


ADAPTIVE SPEECH EMOTION REPRESENTATION 
LEARNING BASED ON DYNAMIC GRAPH 


Adaptive Super Resolution For One-Shot Talking-Head 
Generation 


ADAPTIVE VIDEO WATERMARKING WITH PERCEPTUAL 
GUARANTEE AND EFFICIENCY OPTIMIZATION 


ADAPTIVE-AVG-POOLING BASED ATTENTION VISION 
TRANSFORMER FOR FACE ANTI-SPOOFING 


ADDRESSING CONFOUNDS IN FUNCTIONAL 
CONNECTIVITY ANALYSES OF CALCIUM IMAGING 


Addressing Data Scarcity In Voice Disorder Recognition 
with Self-Supervised Models 


ADHD DIAGNOSIS AND BIOMARKER DETECTION 
BASED ON MULTIMODAL GRAPH CONVOLUTIONAL 
NEURAL NETWORK 


Paper 


Number Paper Title 


2191 


4307 


8316 


9652 


4633 


9094 


6368 


4062 


3141 


8559 


8708 


8391 


8790 


TNS 


4994 


2588 


ADIFT: ZERO-SHOT GENERATIVE MODEL ADAPTION 
VIA ADAPTIVE DOMAIN-INVARIANT FEATURE 
TRANSFER 


Advancing Acoustic Howling Suppression through 
Recursive Training of Neural Networks 


Adversarial Domain Adaptation for Classification with 
Nested Dichotomies 


ADVERSARIAL JAMMING FOR AUTOENCODER 
DISTRIBUTION MATCHING 


ADVERSARIAL LEARNING ON COMPRESSED 
POSTERIOR SPACE FOR NON-ITERATIVE SCORE-BASED 
END-TO-END TEXT-TO-SPEECH 


Adversarial Robustness of Convolutional Models 
Learned in the Frequency Domain 


ADVERSARIAL SPEECH FOR VOICE PRIVACY 
PROTECTION FROM PERSONALIZED SPEECH 
GENERATION 


ADVSHADOW: EVADING DEEPFAKE DETECTION VIA 
ADVERSARIAL SHADOW ATTACK 


ADVSV: AN OVER-THE-AIR ADVERSARIAL ATTACK 
DATASET FOR SPEAKER VERIFICATION 


AdvTTS: Adversarial Text-to-Speech Synthesis Attack 
on Speaker Identification Systems 


AEAM3D:Adverse Environment-Adaptive Monocular 
3D Object Detection via Feature Extraction 
Regularization 


AEGIS-Net: Attention-guided Multi-Level Feature 
Aggregation for Indoor Place Recognition 


Aerial-IRS-Assisted Load Balancing in Downlink 
Networks 


AGADIR: Towards Array-Geometry Agnostic Directional 
Speech Recognition 


AG-LSEC: AUDIO GROUNDED LEXICAL SPEAKER 
ERROR CORRECTION 


AHRNet: Attention and Heatmap-based Regressor for 


Paper 


Number Paper Title 


7621 


4678 


3362 


5296 


3709 


4142 


6831 


5255 


3826 


7430 


8738 


10024 


9668 


8453 


Hand Pose Estimation and Mesh Recovery 


AINUR: HARMONIZING SPEED AND QUALITY IN DEEP 
MUSIC GENERATION THROUGH LYRICS-AUDIO 
EMBEDDINGS 


ALIGN, ADAPT AND INJECT: AUDIO-GUIDED IMAGE 
GENERATION, EDITING AND STYLIZATION 


All Neural Kronecker Product Beamforming for Speech 
Extraction with Large-scale Microphone Arrays 


ALLEVIATING HALLUCINATIONS VIA SUPPORTIVE 
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